CDS
Accession Number | TCMCG075C19520 |
gbkey | CDS |
Protein Id | XP_017978939.1 |
Location | join(19217481..19217657,19217915..19218010,19218133..19218245,19218333..19218399,19218570..19218662,19218831..19218974,19219520..19219608,19219733..19219838,19219919..19220006,19220190..19220308,19220401..19220565,19220855..19221033,19221187..19221296,19221608..19221717,19221810..19221920,19222002..19222112,19222375..19222580,19222670..19223006,19223346..19223456) |
Gene | LOC18596132 |
GeneID | 18596132 |
Organism | Theobroma cacao |
Protein
Length | 843aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018123450.1 |
Definition | PREDICTED: beta-galactosidase 1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | beta-galactosidase |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE | - |
KEGG_ko | - |
EC | - |
KEGG_Pathway | - |
GOs |
GO:0003674
[VIEW IN EMBL-EBI] GO:0003824 [VIEW IN EMBL-EBI] GO:0004553 [VIEW IN EMBL-EBI] GO:0004565 [VIEW IN EMBL-EBI] GO:0005575 [VIEW IN EMBL-EBI] GO:0005618 [VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005737 [VIEW IN EMBL-EBI] GO:0005773 [VIEW IN EMBL-EBI] GO:0005911 [VIEW IN EMBL-EBI] GO:0009505 [VIEW IN EMBL-EBI] GO:0009506 [VIEW IN EMBL-EBI] GO:0015925 [VIEW IN EMBL-EBI] GO:0016787 [VIEW IN EMBL-EBI] GO:0016798 [VIEW IN EMBL-EBI] GO:0030054 [VIEW IN EMBL-EBI] GO:0030312 [VIEW IN EMBL-EBI] GO:0043226 [VIEW IN EMBL-EBI] GO:0043227 [VIEW IN EMBL-EBI] GO:0043229 [VIEW IN EMBL-EBI] GO:0043231 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044444 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0055044 [VIEW IN EMBL-EBI] GO:0071944 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGGACTCCAACTCCAAGCTTCCTGTAATGTGGAATGCTCTGTTGGTTCTTCTGTTTGCTTCCTGGGTTTGTTCAGTTTCTGCCTCTGTTTCCTATGACCGTAAGGCTATCACCATTAATGGGCAAAGAAGGATCCTCATTTCTGGATCCATTCACTACCCAAGAAGCTCACCTGAGATGTGGCCAGATCTTGTACAGAAGGCTAAAGAAGGAGGGCTAGATGTGATTCAGACTTATGTTTTTTGGAATGGCCATGAGCCTGCACCTGGCAAATATTACTTCCAAGGCAACTATGATTTGGTCAAATTTATTAAGCTGGTTCAGCAGGCAGGCCTCTATGTTCATCTGAGGATTGGTCCTTATGTCTGTGCTGAGTGGAACTTTGGGGGTTTTCCTGTTTGGCTGAAGTACATTCCCGGTATCAATTTCAGAACAAACAATGGACCTTTCAAGGCTCAAATGCAAAGATTTACAGAAAAGATTGTGGATATGATGAAAGCTGAAAGGTTGTTTGAGTCTCAAGGAGGTCCTATAATTCTATCTCAGATTGAAAATGAATATGGACCCATGGAATATGAACTTGGGGCACCCGGTAAAGCTTACACTGATTGGGCAGCTAAAATGGCTGTGGGACTAGGCACTGGTGTCCCATGGGTCATGTGCAAACAAGATGATGCACCCGATCCTATTATTAACACCTGCAATGGTTTCTACTGTGACTACTTTTCTCCTAACAAGGCCTACAAACCAAAGATATGGACTGAAGCCTGGACAGGCTGGTATACTGAGTTTGGAGGTGCAGTTCCTTACCGACCTGCTGAAGACTTGGCATTTTCAGTTGCAAGATTTATACAAAAAGGAGGAGCATTCATTAATTATTATATGTATCATGGAGGAACAAATTTTGGCCGAACTGCTGGGGGTCCTTTCATTGCTACTAGCTATGATTATGATGCTCCTCTTGATGAATATGGACTGTTGAGGCAACCCAAATGGGGCCATTTGAAAGATTTGCATAGAGCAATAAAACTCTGTGAACCAGCTTTAGTAAATGGAGATCCCACTGTGATGCGACTTGGAAACTATCAGGAGGCTCATGTATTCAAATATCAGTCTGGAGGTTGTGCTGCCTTCCTTGCAAATTACAACCCAAGATCTTTTGCAAAAGTTGCCTTTGGGAACATGCACTACAACCTGCCTCCTTGGTCTATCAGCATTCTTCCTGACTGCAAGAATACTGTGTATAACACTGCAAGGGTTGGTGCCCAAATTGCACGGAAGAAAATGGTTCCTGTTCCCATGCATGGAGCGTTCTCTTGGCAGGCATTCAGTGAAGAGACAGCTTCGGATGTTGACAGTTCATTCACAATGGTCGGATTGTTGGAGCAGATAAATACAACCAAAGATGCAACTGACTATTTGTGGTACACAACAGACATTAAGATTGACCCCAGTGAAGGATTCTTGAAGAATGGAAACTCTCCTGTTCTTACTATCTTATCAGCTGGCCATGCTTTGCATGTTTTTGTCAATGGTCAACTATCAGGAAGTGCCTATGGAAGTCTTGAATTCCCCAAACTAACATTCAGCCAAGGTGTTAATTTGAGAGCTGGTGTCAACAAAATTTCACTTTTGAGTATTGCTGTTGGTCTCCCAAATGTTGGTCCACATTTTGAGACATGGAATGCTGGTATTCTTGGCCCGGTTACATTGAATGGTCTCAATGAGGGAAGGAGAGATCTCTCATGGCAGAAATGGTCTTACAAGATTGGCCTTGAAGGAGAAGCATTGAATCTTCATTCACTAAGTGGTAGTTCCTCAGTGGAGTGGGCACAGGGGTCCTTTGTTGCACGAAGGCAGCCACTGATGTGGTATAAAACAACTTTCAATGCTCCAGCTGGAAATGCTCCGTTGGCTTTAGATATGCACAGTATGGGGAAAGGTCAGATTTGGATAAATGGACAGAGCATTGGACGCCACTGGCCTGCATATAAAGCATCTGGCAATTGTGGTGACTGTAATTATGCTGGAACATATGATGAGAAGAAATGTAGAACTAATTGTGGAGAGGCCTCTCAAGGATGGTATCACATTCCTCGTTCATGGCTCAACCCAACAGGGAATTTGTTGGTTGTGTTTGAGGAATGGGGTGGTGACCCCAATGCAATTTCTCTGGTCCGCAGAGAAACTGACAGTGTTTGTGCTGATATCTATGAGTGGCAACCAACTCTTATGAATTACCAGATGCAAGCCTCTGGTAAAGTCAACAAACCTCTAAGGCCAAAAGTTCATTTAGAGTGCGATGCTGGGCAGAAAATCTCGGCAGTAAAGTTTGCCAGCTTTGGAACGCCAGAAGGGGCCTGTGGAAGCTACTGTGAAGGAAGCTGCCATGCTCACCACTCTTATGATGCTTTTAATAGGCTTTGTGTTGGGCAGAACTTCTGCTCGGTGACTGTAGCACCTGAAATGTTTGGAGGAGACCCATGTCCTAGTGTCATGAAGAAACTCTCTGTGGAGGTCATTTGCAGCTGA |
Protein: MDSNSKLPVMWNALLVLLFASWVCSVSASVSYDRKAITINGQRRILISGSIHYPRSSPEMWPDLVQKAKEGGLDVIQTYVFWNGHEPAPGKYYFQGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGINFRTNNGPFKAQMQRFTEKIVDMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGKAYTDWAAKMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKIWTEAWTGWYTEFGGAVPYRPAEDLAFSVARFIQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVNGDPTVMRLGNYQEAHVFKYQSGGCAAFLANYNPRSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQIARKKMVPVPMHGAFSWQAFSEETASDVDSSFTMVGLLEQINTTKDATDYLWYTTDIKIDPSEGFLKNGNSPVLTILSAGHALHVFVNGQLSGSAYGSLEFPKLTFSQGVNLRAGVNKISLLSIAVGLPNVGPHFETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLEGEALNLHSLSGSSSVEWAQGSFVARRQPLMWYKTTFNAPAGNAPLALDMHSMGKGQIWINGQSIGRHWPAYKASGNCGDCNYAGTYDEKKCRTNCGEASQGWYHIPRSWLNPTGNLLVVFEEWGGDPNAISLVRRETDSVCADIYEWQPTLMNYQMQASGKVNKPLRPKVHLECDAGQKISAVKFASFGTPEGACGSYCEGSCHAHHSYDAFNRLCVGQNFCSVTVAPEMFGGDPCPSVMKKLSVEVICS |